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CN ■ Abstract 

F ^ , The well known Hopcroft's algorithm to minimize deterministic complete au- 

Ph I tomata runs in 0(A:nlogn)-time, where k is the size of the alphabet and n the 

number of states. The main part of this algorithm corresponds to the computation 
of a coarsest bisimulation over a finite Deterministic Labelled Transition System 
(DLTS). By applying techniques we have developed in the case of simulations, we 
■ design a new algorithm which computes the coarsest bisimulation over a finite DLTS 

t^H , in 0(mlogra)-time and 0(£; + m + n)-space, with m the number of transitions. The 

underlying DLTS does not need to be complete and thus: m < kn. This new 
algorithm is much simpler than the two others found in the literature. 

> ■ 

o\ 1 Introduction 

oo 

^ I Bisimulation and simulation equivalences are behavioral relations between processes. They 
; are mainly used to reduce the state space of models since they preserve branching time 
temporal logic like CTL and CTL* for bisimulation [4] and their existential fragment for 
simulation [B]. Simulation can also be used as a sufficient condition for the inclusion of 
languages when this test is undecidable in general [3]. 

Let us call coarsest bisimulation problem, the problem of finding the coarsest bisimula- 
tion equivalence in a Labelled Transition System (LTS) and coarsest simulation problem 
the corresponding problem for simulation. One can establish a hierarchy, with increas- 
ing difficulty, from the coarsest bisimulation problem in a finite deterministic LTS, to 
the coarsest simulation problem in a finite non deterministic LTS. The first case was 
efficiently solved by Hopcroft, to minimize finite deterministic automata, in [7] with an 
algorithm whose complexities are O (kn log n)-time and 0(fcn)-space [8], with k the size of 
the alphabet and n the number of states. The next important step, the coarsest bisimu- 
lation problem for finite non deterministic LTS, was partially solved by Paige and Tarjan 
in [H]. We say partially because it was for k = 1. The complexities of their algorithm are 
0(m log ro)-time and 0(m + n)-space with m the number of transitions. An extension to 
the general case k > 1 was proposed later by Fernandez [5]. According to [11] the time 
complexity of the algorithm of Fernandez is O (kn log n). 
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In the present paper, by using techniques we have developed for the coarsest simulation 
problem in [2] , we design now a new algorithm to avoid the k factor in the time complexity 
of the coarsest bisimulation problem over finite DLTS. We do this in order to obtain a 
time complexity of O(mlogn) and a space complexity of 0(k + n + m). 

In the literature, there are two papers which achieve the same result over finite DLTS, 
both are designed to minimize finite deterministic automata. The first one [TT] . by 
Valmari and Lehtinen, may be considered as the best solution using the ideas of Hopcroft, 
in the sense that conceptually their splitters are couples made of a block and of a letter. 
However, they need, beside the traditional partition of the states during the computation, 
a partition of the set of transitions. The second one, by Beal and Crochemore [T] is closer 
to our solution but is more complex with its use of "all but the largest strategy" and 
"signature of states" . 

2 Preliminaries 

Let Q be a set of elements. The number of elements of Q is denoted \Q\. A binary relation 
on Q is a subset of Q x Q. In the remainder of this paper, we consider only binary relations, 
therefore when we write "relation" read "binary relation". Let 3% be a relation on Q. For 
X,Y C Q, we note X MY to express the existence of two states q,q' G Q such that 
(q, q') G X x Y C\M. By abuse of notation, we also note q&Y for {q} &Y , X & q' for 
XM{q'} and q&q' for {q} & {q'}. In the figures we draw X---S -->Y for X £%Y . We 
note &$~ x the inverse of 2% such that q&~ l q' iff q' M q. We define M(q) = {q' G Q\q&q'} 
for q G Q and M(X) = U qe x&(q) for ICQ, Let ,Y be another relation on Q, the 
composition of M by y is 5? o 3& = {(x,y) E Q x Q\y e S?(M(x))}. The relation 
ffl is said reflexive if for all q G Q, we have qMq. The relation ^ is said symmetric if 
= e^. The relation ^ is said transitive if ^ o M C An equivalence relation is 
a reflexive, symmetric and transitive relation. For an equivalence relation ^ on Q and 
q E Q we call ^(g) a the block of q. It is well known, and immediate consequences 

of the definitions, that for any equivalence relation & and any block B of 3?, we have 
Vg, q' G B .qM q' , and for any ICQ, <^(X) is a, disjoint, union of blocks. 

Let X be a set of subsets of Q, we note UX = VJ BeX B. A partition of Q is a set of 
non empty subsets of Q, also called blocks, that are pairwise disjoint and whose union 
gives Q. There is a duality between a partition of Q and an equivalence relation on Q 
since from a partition P we can derive the equivalence relation Mp = UsepB x and 
from an equivalence relation 3? we can derive a partition = U{£$(q) C Q | g G Q}- 

Let T = (Q, S, — >■) be a triple such that Q is a set of elements called states, S is an 
alphabet, a set of elements called letters or labels, and ->C Q x E x Q is a transition 
relation or sei of transitions. Then, T is called a Labelled Transition System (LTS). From 
T, given a letter a G S, we define the two following relations on Q: — >= {(q,q') G 

Q x Q | (g, a, <?') G— )•} and its reverse pre_^ = (A) . When — >■ is clear from the context, 

we simply note pre a instead of pre_^. Finally, if for all a G E, the relation A- is a function 

(i.e. Va G S Vg G Q . |— >(<z)| < 1) then T is said deterministic and thus a DLTS. 
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3 Underlying Theory 



The presentation of this section is quite different from what is found in the literature. 
Thanks to the results it provides, I hope the reader will be convinced of its pertinence. 

Let T = (<5,S,— >) be a LTS. The classical definition of a simulation says that a 
relation y C Q x Q is a simulation over T if for any transition q\ — > q[ and any state 
q2 G Q such that q\y q 2 , there is a transition g 2 — >■ q' 2 such that q[y q' 2 - However, 
the following definition happens to be more effective than the classical one. As put in 
evidence by the picture on the right, the two definitions are clearly equivalent. 

Definition 1. Let T = (Q, £,—)•) be a LTS and y be a relation 
on Q. The relation y is a simulation over T if: 

Va G S . y o pre a C pre a o y . 

For two states q, q' G Q, we say "q is simulated by q'" if there is a 
simulation y over T such that qy q'. 

A bisimulation y is just a simulation such that y~ x is also a simulation. For a 
bisimulation 5?, two states q and q' such that qy q' are said bisimilar. 

The main idea to obtain efficient algorithms is to consider relations between blocks of 
states and not merely relations between states. Therefore, we need a characterization of 
the notion of bisimulation expressed over blocks. 

Proposition 2. Let T = (Q, E, — >) be a LTS and y be an equivalence relation on Q. 
The relation y is a bisimulation over T if and only if for all block B of y we have: 
VflGS.yo pre a (5) C pre a (S). 

Proof. If y is a bisimulation, by definition it is a simulation. Then, we have for any 
B C Q: Va 6 E . y o pre a (5) C pre a (y(B)). This inclusion is thus also true for a block 
B = S^{q) for a given q G Q. With the transitivity of an equivalence relation, we get 
y(B) = 5? o y(q) = y(q) = B. All of this implies: Va G S . & o pre a (B) C pre a (B). 

In the other direction, let us consider a given q G Q. Let B = y{q). An equivalence 
relation is reflexive. Therefore, we get: q G B. Then, 

y o pre a (g) C y o pre a (5) C pre a (B) = pre a o y(q) . 

Which implies that y is a simulation. Since y is an equivalence relation we have 
y = y~ x and thus y~ x is also a simulation. This ends the proof. □ 

Let ffl be an equivalence relation on Q. Thanks to the preceding proposition, if 3% is 
not a bisimulation over T, there is a G X and B a block of Si such that Z% o pre a (B) (£ 
pre a (B). This implies the existence of a subset of Q, let us call it Remove a (5), such that: 

^ o pre a (5) C pre a (S) U Remove a (S) . (1) 

The problem with ([1]) is that Remove a (5) depends on a letter which we want to avoid 
in order to obtain an algorithm whose complexity does not depend on the size of the 
alphabet. It would therefore be more interesting to have something like: 

& o pre a ( J B) C pre a (5 U NotRel(B)) . (2) 
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But, this is possible only if & o pre a (S) C pre (Q). A sufficient condition is: 

VaeS.^opre a (Q) Cpre a (Q) . (3) 

The fact is that fl3]) is not a real restriction because any bisimulation included in M 
satisfies this condition: a state which has an outgoing transition labelled by a letter a 
can be bisimilar only with states which have at least one outgoing transition labelled by 
a. This is exactly what 03) says about Surprisingly, all of this is well known, but we 
have found no algorithm that uses this as an optimization during an initialization phase. 
For the present paper this not an optimization, this is a necessity. 

The next definition and lemma establish that we can restrict our problem of finding 
the coarsest bisimulation inside an equivalence relation & to the problem of finding the 
coarsest bisimulation inside an equivalence relation & that satisfies (j3J). 

Definition 3. Let T = (Q, £, — >) be a LTS and 3? be an equivalence relation on Q. We 
define InitRefine(^) C M such that: 

(q, q') G InitRefine(«^) <^> (q, q') G ffi A Va G E (q G pre a (Q) q' G pre a (Q)) . 

Lemma 4. Let T = (Q, E, — >) be a LTS and % = InitRefine(^) with & an equivalence 
relation on Q. Then, % is an equivalence relation on Q and all bisimulation over T 
included in M is included in % . 

Proof. 

• Since is an equivalence relation and thus reflexive, % is also trivially reflexive. 
Its definition being symmetric, % is also symmetric. Now, let us suppose is 
not transitive. There are three states gi, q 2 , g 3 G Q such that: q\ % g 2 A g 2 % g 3 A 
-i gi % g 3 . From the fact that C 3% and Si is an equivalence relation, we get 
qi 3? q%. With -< qi g 3 and the definition of there is a G E such that only one of 
{<7i> #3} belongs to pre a (Q). Let us suppose we have gi G pre a (Q) and g 3 G" pre a (Q). 
The problem is that gi G pre a (Q) and gi ^ g 2 implies g 2 G pre a (Q). With g 2 g 3 
we also get g 3 G pre a (Q) which contradicts g 3 G" pre a (<3). 

• Let us suppose the existence of a bisimulation 5? included in M but not in . 
There are two states gi,g 2 G Q such that: q\S^ q 2 A -1 gi^g 2 . From 5? C 
we get q x M g 2 . With -1 q% g 2 and the definition of % there is a G E such that 
only one of {gi,g 2 } belongs to pre a (Q). Let us suppose we have q 1 G pre a (Q) and 
g 2 g" pre a (Q). With g x J^g 2 we get g 2 G ^ o pre a (Q). With the hypothesis that 5? 
is a simulation, we get g 2 G pre a o 5?{Q) and thus g 2 G pre a (Q), since 5^{Q) C Q, 
which contradicts g 2 G" pre a (Q). 

□ 

The conjunction of (jHJ) and of the trivial condition 3%{Q) = Q is the cornerstone of 
what follows. 

Definition 5. Let T = (Q, E, — >) be a LTS and M be an equivalence relation on Q. 

• A subset L of Q is said a potential splitter of M if 3i{V) = L, L is composed of at 
least two blocks of & and: Va G E . M o pre a (L) C pre a (L). 
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• a couple (Li, L 2 ) of two non empty subsets ofQ is said a splitter of M if: LiHL 2 = 0, 
St{L x ) = L x , S£(L 2 ) = L 2 and: VaeS.f o pre a (Li U L 2 ) C pre a (Li U L 2 ). 

The key element of the preceding definition is that a splitter does not depend on a 
specific letter. This is in sharp contrast to what is generaly found in the literature since 
[7j (to our knowledge, there is only one exception: pQ). The second difference with the 
literature is that our splitters are couples of sets of blocks. Indeed, in the literature, the 
second element of our splitters is hidden. However, its presence is essential as it allows 
us to split sequentially, with different letters, the current relation with the same splitter. 
Furthermore, it will give us more freedom in the choice of the splitter to use. 

From now on, we consider that T = (Q, E, — >) is a DLTS, M is an equivalence relation 
on Q and (Li, L 2 ) is a splitter of St. We define for X C Q: 



Split (X, 



\ lW(g, q'), (q\ q)eQxQ\q'e St(q) A q' <j£ X] 



Split (X, St) amounts to split all blocks C such that C fl X ^ 
blocks C\ 



CHX and C 2 = C\X, see Fig. 



la 



and C \ X ^ in two 
When in an equivalence relation St we 



just split some blocks C in two parts, the resulting relation is still an equivalence relation. 
Therefore, Split(X, M) is still an equivalence relation. Furthermore, a block of M whose 
all elements are in X or for which no element is in X is still a block in Split (X,, 



C 



! 1' 



Li 



Ci 



(a) From^ 1 to ^ . 



v i j 



Li 




(b) Contradiction of 
y (ZM and ,y % °7/. 



Li 



Figure 1: = Split(pre (Li), St). 
Blocks of are dashed, blocks of °l/ are dotted, pre a (Li) is in gray. 



and let us consider a block C of 3% which contains an 

" Since C A {L X UL 2 ), 



see Fig. l(a 



Let % = Split (pre (Li), 
element q in pre^Zq) and an element q' not in pre a (Z/! 
by definition of a splitter, we have C C pre a (Li U L 2 ) and thus q' is in pre a (L 2 ). This 
implies that Split(pre a (Li), ^) = Split (pre a (L 2 ), ^) and in no block of % there is a 
state belonging in pre a (Li) and another state not belonging in pre a (Li). Therefore, 
^ opre a (Lx) C pre a (L!) and, by symmetry, opre a (L 2 ) C pre a (L 2 ). This is illustrated 
by the right part of Fig. 1(a) 

Now, let us suppose there is a bisimulation 5? included in St but not included in , 



see Fig. 1(b) This means that there are two states q and q' of the same block of St 
such that q5? q', q G pre a (Li) and q' ^ pre a (Li). Let q% be the state of L\ such that 
q G pre a (<7i). We thus have q' G 5? o pre a (gi). With the fact that y is a simulation, we 
infer that q' G pre a o y{q\) which implies the existence of a state q[ such that q\ 5? q[ 
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and q' G pre a (g^). But, remember, we have 5? C M, q\ G L\ and M{L\) = L\. Therefore, 
q[ G L\ and g' G pre a (Li). Which contradicts q' ^ pre a (Li). 
From what precedes, we infer the following theorem. 

Theorem 6. Let T = (Q, S, — >) 6e a DLTS and (L\, L 2 ) be a splitter of ' an equivalence 
relation on Q. Let = Split(pre a (Li),^). Then, % is an equivalence relation and any 
bisimulation included in M is also included in % . Furthermore: % opre^-Z^) C pre a (Lx) 
and o pre a (L 2 ) C pre a (L 2 ). 



Let us come back to equation (TTJ and consider Fig. 1(a). We have & o pre a (Li) C 
pre a (Li) URemove a (L 1 ) with Remove^i^) = pre a (L 2 ) which clearly depends on a. With 
equation ([2]) we have: I ^ , opre a (L 1 ) C pre a (L! UNotRe^Lx)) with NotRel^) = L 2 which 
does not depend on any letter. This was our goal. 

From last theorem, the algorithm will maintain a set of potential splitters. At each 
main iteration, we choose L one of them. Then, we choose in L a block B, no matter 
which one. We set L\ to be the smallest set between B and L\B; we set L 2 to be the other 
one. Then, we iteratively split the current partition, with the non empty pre a (Li). The 
magical thing is that (L\, L 2 ) remains a splitter during these iterations even if some blocks 
in L\ or in L 2 are split. Management of the letters being aside, the resulting algorithm is 
quite similar to the one in [9]. However, knowing that the LTS being deterministic, there 
is no need of counters like in [9]. 



4 The Bisimulation Algorithm 



Function Init(T, P init ) with T = (Q, S, ->•) 

1 P := copy(Pi n it); forall a G £ do a. Pre := 0; 

2 forall q A q' E—t- do a. Pre := a. Pre U {q} 

3 forall oGE do (P, _) := Split(a.Pre, P); 

4 return (P) 



In the remainder of the paper, all LTS are finite and deterministic. Given a DLTS 
T = (Q, E, — >) and an initial partition P ini t of Q, inducing an equivalence relation ^i n it, 
the algorithm manages a set S of potentials splitters to iteratively refine the partition 
P initially equals to Pinit- At the end, P represents Pbi S the partition whose induced 
equivalence relation & his is the coarsest bisimulation over T included in ^i n i t . 

The partition P is a set of blocks. A block is assimilated with its set of states. The 
set S is a set of subsets of Q. We will see that S is a set of potential splitters. To each 
letter a G £ we associate a set of states noted a. Pre since it corresponds to pre a (Q) after 
the forall loop at line |5] of Init and to pTe a (Smaller) after the forall loop at line O of 
DBisim. 

The main function DBisim uses two others functions: Split, and Init. Function 
Split(X, P) splits each block C of P, having at least one element in X and another one 
not in X, in two blocks C\ = C D X and C 2 = C\X, and returns the resulting partition 
and the list of blocks C that have been split. It is mainly an implementation of function 
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Function DBisim (T, P init ) with T = (Q, E, ->•) 



1 P:= Init(T,P init )}; 

2 if |P| = 1 then return (P) /* nothing more has to be done */}; 

3 S := {Q}; alph : = 0; forall a G E do a.Pre := 0; 

4 while 3L E S do 

5 /* Assert : alph = A (Va G E . a.Pre = 0) */; 

6 Let 5 be any block of L; 

7 if iftere are on/t/ two blocks in L then S :— S\ {L} else L := L \ B; 

8 if |P| < \L\B\ then Smaller := B else Smaller := L \ B; 

9 forall g A gi G— )■ such that gi G Smaller do 
10 a/p/i := aip/i U {a}; a.Pre := a.Pre U {q}; 

n forall a G a/pa do 

12 (P, Splitted) := Split(a.Pre, P); 

13 forall C G Splitted do 

14 if C g U5 then S := 5 U {C}; 
is forall a G alph do a.Pre := 0; 

16 alph := 0; 

17 P bis := P; return (P bis ) 



Split(X, Mp) seen in the previous section. Function Init, also uses Split and returns 
the partition whose induced equivalence relation is Init Refine (^nit)- 

4.1 Correctness 

Let us first consider function Init(T, p n it). All line numbers in this paragraph refer to 
function Init. At line |2j we identify for each a G E the states which have an outgoing 
transition labelled by a. Then, at line [31 for each a G E we separate in all the blocks the 
states which have an outgoing transition labelled by a and the states that do not have 
an outgoing transition labelled by a. Clearly, the resulting partition corresponds to the 
relation Init Refine (Pmit)- 

Let us consider function DBisim(T, P init ). From now on, all line numbers refer to 
function DBisim. Let 5" = {B G P | B % US'}. We prove, by an induction, that the 
following property is an invariant of the while loop of DBisim. The relation M is the 
equivalence relation induced by the current partition P. 

L G S =>• M(L) = L and L contains at least two blocks of M . (4) 

Just before the execution of the while loop, S contains only one element: Q. Thanks 
to the test at line EJ Q is made of at least two blocks, and we obviously have ffl{Q) = Q, 
which satisfies property (jH). 

Let us assume 01]) is satisfied before an iteration of the while loop. The set S can be 
modified only at lines [7]and[TH Let L be the element of S chosen at lineSJ By induction 
hypothesis, L is composed of at least two blocks and t%(L) = L. If L is withdrawn from 
S, at line [TJ this is because L is composed of exactly two blocks that are implicitly added 
in S'. If L is not withdrawn from S then a block, B, is withdrawn from L which is 
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composed, induction hypothesis and condition of the if at line [JJ of at least three blocks. 
From the hypothesis that 3S(L) = L and from 3?(B) = B since B is a block, we also 
have M{L \ B) = L\B. Property (jl]) is therefore not modified by line UJ At line HH a 
block C which has been split at line H21 and thus implicitly withdrawn from S', is added 
into S. Since C has just been split, and thus contains two complete blocks, property fll]) 
is still true and thus is an invariant of the while loop. 
Now, let us consider the two following properties: 

5? is a bisimulation over T included in St-^ =>• C ^ . (5) 
^VoGS.f o pre a (L) C pre a (L) . (6) 

Thanks to function Init and Lemma HI these properties are satisfied before the while 
loop. Remember that S = {Q} and S' = at this moment. Let us suppose they are 
satisfied before an iteration of the while loop. Let L be the element of S chosen at lineHJ 
After line [8j L = L\ U L 2 with L\ = Smaller and L 2 = L \ Smaller. Since Smaller 
is a block and M(L) = L by fll]), we have: ffl{L\) = L\ and M{L 2 ) = L 2 . This implies 
that (Li,L 2 ) is a splitter of 3?. Note that, during the iteration, 3% can only be refined 
(by function Split, line[T2]). This implies that (Li,L 2 ) stays a splitter of 3$ during the 
iteration. Furthermore, after each iteration of the forall loop at line [TTJ from Theorem [HJ 
we have 5? C 3i since this is the case, by induction hypothesis, before the iteration. 
The fact that, 3% can only be refined also implies that property fl6]) is still satisfied after 
the iteration for all elements of S U S' different from L\ and L 2 . Let us consider their 
cases. Let 3? c be the value of 3? after the iteration of the forall loop at line [11] for a = c. 
Then, from Theorem [H] we have 3? c o pre^Lx) C pre^I^) and 3$ c o pre c (L 2 ) C pre c (L 2 ). 
At the end of the iteration of the while loop, we obviously have 3? C 3S C and thus: 
Va G alph . (<^opre a (Li) C pre a (Li) A^opre a (L 2 ) C pre a (L 2 )). Let a ^ alph this means 
that pre a (Li) = pre a (Smaller) = and thus 3?opre a (Li) C pre a (Li). Therefore, we have, 
with 3?' the value of 3? before the iteration: ^"opre a (L 2 ) = ^'opre a (L) which is included, 
by induction hypothesis of in pre a (L) = pre a (L 2 ) and thus 3? o pre a (L 2 ) C pre a (L 2 ) 
at the end of the iteration. In summary, properties (EJ) and (E]) are invariants of the while 
loop. 

We postpone to the next section the proof of termination of DBisim. This will be done 
by a complexity argument. For the moment, note that the execution of the while loop 
ends when S 1 = and thus, by definition of S', P as a set of blocs is included in 5". With 
([H]), all of that implies, with 3? his the last value of 3£: MB e PVa G £ . 3? his o pre a (S) C 
pre a (B). By Proposition [2] this means that bisimulation, by © it is the coarsest 

one included in 3?i n n. Furthermore, since 3?bis = 3?p his with Pbis a partition, the last value 
of P, it is an equivalence relation. 

4.2 Complexities 

Let A be a set of elements, we qualify an encoding of X as indexed if the elements of 
X are encoded in an array of \X\ slots, one for each element. Therefore, an element 
of X can be identified with its index in this array. Let T = [Q, >) be a LTS, an 
encoding of T is said normalized if the encodings of Q, £ and — > are indexed, a transition 
is encoded by the index of its source state, the index of its label and the index of its 
destination state, and if \Q\ and |S| are in 0(\— >\). If |E| is not in 0(\— >\), we can 



8 



restrict £ to its really used part £' = {a E £ | 3g, q' E Q . q q' E— >} whose size is less 
than |— y\. To do this, we can use hash table techniques, sort the set — > with the keys 
being the letters labelling the transitions, or more efficiently use a similar technique of 
the one we used in the algorithm to distribute a set of transitions relatively to its labels 
(see, as an example, the forall loop at line of DBisim). This is done in 0(|£| + |— >\) 
time and uses 0(|£|) space. We learned that this may be done in C> ( | — > | ) time, still 
with 0(|£|) space, by using a technique presented in |ITj and which is also called "weak 
sorting" according to pQ. If \Q\ is not in 0(|— >\) this means there are states that are 
not involved in any transition. In general, these states are ignored. In fact, just after 
our initialization phase done by function Init, these states are in blocks that will not 
changed during the execution of the algorithm. Therefore, we can also restrict Q to its 
useful part {q E Q | 3</ E Q 3a E £ . q A- q' E— > V q' A- q E— >} whose size is in 0(\— 
This is done like for E. 

All encodings of LTS in this section are assumed to be normalized. 

Let us assume the following hypotheses: 

• scanning the elements of one of the following sets is done in time proportional to 
its size: — >, a. Pre for any a E E, L an element of S, B a block of P, the set of 
transitions leading to an element L of S or to a block B of P, and Splitted. 

• Split(a.Pre, P) is executed in time proportional to the size of a. Pre. 

• function copy(P init ) is executed in time. 

• all the other individual instructions in functions Init and DBisim are done in 
constant time or amortized constant time. 

• all the data structures use only 0(\— >\) space. 

Let us consider the time complexity of function Init. The forall loop at lined] is done 
in time 0(|£|) and thus 0(\— >\) since T is supposed to be normalized. The forall loop 
at line [2] is done in time 0(\ — The forall loop at line [3] is also done in time 0(\— >\) 
since we have: £ ae s|a.Pre| < £ ae s|— >\ < |— >\- Therefore, Init is done in 0(\— 7-|)-time. 

Let us now consider function DBisim. Let us first remark that during an iteration 
of the forall loop at line [TTj \Splitted\ < |a.Pre|. Furthermore, during an iteration of 
the while loop, \alph\ and Eaga^la.Prel are less than the number of transitions scanned 
during the forall loop at line This implies that the time complexity of function 
DBisim is proportional to the overall number of transitions scanned at line[9j But thanks 
to line [8], each time a transition q — > q' is used at line M, <?i belongs to the set Smaller 
whose size is less than half of the size of the previous time. 

From all of this, we get the following theorem. 

Theorem 7. Let T = (Q, E, — >) be a DLTS and Pi n ; t be an initial partition of Q inducing 
an equivalence relation 8%-^- Function DBisim computes Pu s the partition whose corre- 
sponding equivalence relation is the coarsest bisimulation over T included in 
0(|-H log \Q\)-time and 0(|£| + |^| + \Q\)- space. 

The 0(|Q| + |E|) part in the space complexity of the theorem is due to the normal- 
ization of T as explained at the beginning of this sub section. 
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4.3 Implementation 

The data structures can be similar to what is classically used in minimization and bisim- 
ulation algorithms like those in [TJ [U] . However we do prefer some ideas found in [TTJ and 
rediscovered in [2] . Instead of a doubly linked list to represent the states of a block, both 
papers distribute the indexes of the states in an auxiliary array A such that states in a 
same block form a subarray of A. Furthermore, when a block C has to be split in C\ and 
C2, the states of C\ and C2 stay in the same subarray of A corresponding to C. The only 
modification is that states of C\ are put on the left side of that subarray and thus states 
of C2 on the right side. Therefore, to encode a block, we just have to memorize a left 
and a right index. A key advantage for the present paper, is that to represent an element 
L of S we also just have to memorize a left and a right index in A. Therefore, when 
the blocks of L are split, we do not have to update these variables. The other elements 
are classic: each state knows the block to which it belongs, each block and each element 
of S maintains its size, each block maintains a boolean to know whether it belongs to 
US' (this information is transmitted to the sub block when a block is split), S may be 
encoded by a file or a by a stack. To choose a block in L at line [6] we just choose the left 
or the right block in the subarray of A corresponding to L. Thus, it is easy to perform 
the instruction L := L\B in line [7] and to scan the elements of Smaller. The transitions 
are also initially sorted by a counting sort in order to have the transitions leading to the 
same state form a subarray of the array of all the transitions. This allows us to scan the 
transitions leading to a given state in time proportional to their number. 

The variable Splitted is used just for the clarity of the presentation. Indeed, when a 
block C not in one of the potential splitters of S is split, this is detected during the call 
of function Split and C is directly added into S. 

The implementation of function Split (X,P) is not original (see for example [21 Sect. 
6] where a version which also returns the blocks in X is given). The time complexity of 
a call is 0(|X|) when the elements of X may be scan in time proportional to its size. 

In [21 Sect. 6] an encoding, satisfying the hypotheses, of alph and the a. Pre is given. 
However, their role is just to distribute the set of transitions leading to Smaller in 
function of their label. Then, for each label found, we scan the corresponding transitions 
to do the split. The distribution of the transitions can simply be done by a kind of 
counting sort. It runs in 0(\Smaller\)-txme and uses 0(\— )-|)-space. The "weak sorting" 
technique used in [1] can also be used. 

In summary, the data structures that we use satisfy the hypotheses given at the second 
paragraph of Sect. 14.21 under the assumption of a normalized DLTS. 

5 Main Application and Future Works 

The main application of the coarsest bisimulation problem over finite DLTS is the mini- 
mization of deterministic automata (LTS with the precision of an initial state and a set 
of final states). It is well known that, when there is no useless state (a state q is useless 
if there is no path from the initial state to q or if there is no path from q to a final state) 
this amounts of finding the coarsest bisimulation included in the following equivalence 
relation: M^ t = {(q, q') G Q x Q | q G F <^> q' G F} with F C Q the set of final states. 
The blocks of this relation are: F, if it is not empty (but in that case the minimal au- 
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tomata is the empty one), and (Q \ F) if it is also not empty. Therefore, the algorithm 
presented in the present paper can be used to minimize a deterministic automata with 
the complexities announced. 

Although this was not the purpose, a good piece of news is that DBisim is very similar 
to the algorithm in [9J. The main difference being that our LTS are deterministic. An 
extension of the present paper to the coarsest bisimulation problem over LTS will be 
done. This will yield an algorithm with the same time and space complexities of [10] but 
simpler. 
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